home *** CD-ROM | disk | FTP | other *** search
- **********************************************************
- * Eukaryotic putative RNA-binding region RNP-1 signature *
- **********************************************************
-
- Many eukaryotic proteins that are known or supposed to bind single-stranded
- RNA contain one or more copies of a putative RNA-binding domain of about 90
- amino acids [1,2]. This region has been found in the following proteins:
-
- ** Heterogeneous nuclear ribonucleoproteins **
- - hnRNP A1 (helix destabilizing protein) (twice).
- - hnRNP A2/B1 (twice).
- - hnRNP C (C1/C2) (once).
- - hnRNP E (UP2) (at least once).
-
- ** Small nuclear ribonucleoproteins **
- - U1 snRNP 70 Kd (once).
- - U1 snRNP A (once).
- - U2 snRNP B'' (once).
-
- ** Pre-RNA and mRNA associated proteins **
- - Protein synthesis initiation factor 4B (eIF-4B) [3], a protein essential
- for the binding of mRNA to ribosomes (once).
- - Nucleolin (4 times).
- - Yeast single-stranded nucleic acid-binding protein (gene SSB1) (once).
- - Yeast protein NSR1 (twice). NSR1 is involved in pre-rRNA processing; it
- specifically binds nuclear localization sequences.
- - Poly(A) binding protein (PABP) (4 times).
-
- ** Others **
- - Drosophila sex determination protein Sex-lethal (Sxl) (twice).
- - Drosophila sex determination protein Transformer-2 (Tra-2) (once).
- - Drosophila 'elav' protein (3 times), which is probably involved in the RNA
- metabolism of neurons.
- - Human paraneoplastic encephalomyelitis antigen HuD (3 times) [4], which is
- highly similar to elav and which may play a role in neuron-specific RNA
- processing.
- - Drosophila 'bicoid' protein (once) [5], a segment-polarity homeobox protein
- that may also bind to specific mRNAs.
- - La antigen (once), a protein which may play a role in the transcription of
- RNA polymerase III.
- - The 60 Kd Ro protein (once), a putative RNP complex protein.
- - A maize protein induced by abscisic acid in response to water stress, which
- seems to be a RNA-binding protein.
- - Three tobacco proteins, located in the chloroplast [6], which may be
- involved in splicing and/or processing of chloroplast RNAs (twice).
- - X16 [7], a mouse protein which may be involved in RNA processing in
- relation with cellular proliferation and/or maturation.
- - Nucleolysins TIA-1 and TIAR (3 times) [8] which possesses nucleolytic
- activity against cytotoxic lymphocyte target cells. may be involved in
- apoptosis.
- - Yeast RNA15 protein, which plays a role in mRNA stability and/or poly-(A)
- tail length [9].
-
- Inside the putative RNA-binding domain there are two regions which are highly
- conserved. The first one is a hydrophobic segment of six residues (which is
- called the RNP-2 motif), the second one is an octapeptide motif (which is
- called RNP-1 or RNP-CS). The position of both motifs in the domain is shown in
- the following schematic representation:
-
- xxxxxxx######xxxxxxxxxxxxxxxxxxxxxxxxxxxxx########xxxxxxxxxxxxxxxxxxxxxxxxx
- RNP-2 RNP-1
-
- As a signature pattern for this type of domain we have used the RNP-1 motif.
-
- -Consensus pattern: [RK]-G-{EDRKHPCG}-[AGSCI]-[FY]-[LIVA]-x-[FYM]
- -Sequences known to belong to this class detected by the pattern: ALL, except
- for the 60 Kd Ro protein where the RNP-1 pattern starts with His-Leu instead
- of (Arg/Lys)-Gly, and X16 which has Pro in the first position of the pattern.
- -Other sequence(s) detected in SWISS-PROT: 20.
-
- -Note: in most cases the residue in position 3 of the pattern is either Tyr or
- Phe.
- -Note: this pattern will fail to detect the first occurrence of the pattern in
- the poly(A) binding protein because it has Leu in the first position of the
- pattern. It will fail to detect one of the four copies in nucleolin because
- it has Lys instead of Gly in position 2 of the pattern. It will not detect
- the first copy of the domain in Drosophila Sex-lethal because it has a Phe in
- the first position of the pattern. Finally it will also fail to pick-up the
- first copy of the pattern in elav and HuD because they have Leu in the first
- position of the pattern.
-
- -Last update: October 1993 / Pattern and text revised.
-
- [ 1] Bandziulis R.J., Swanson M.S., Dreyfuss G.
- Genes Dev. 3:431-437(1989).
- [ 2] Dreyfuss G., Swanson M.S., Pinol-Roma S.
- Trends Biochem. Sci. 13:86-91(1988).
- [ 3] Milburn S.C., Hershey J.W.B., Davies M.V., Kelleher K., Kaufman R.J.
- EMBO J. 9:2783-2790(1990).
- [ 4] Szabo A., Dalmau J., Manley G., Rosenfeld M., Wong E., Henson J.,
- Posner J.B., Furneaux H.M.
- Cell 67:325-333(1991).
- [ 5] Rebagliati M.
- Cell 58:231-232(1989).
- [ 6] Li Y., Sugiura M.
- EMBO J. 9:3059-3066(1990).
- [ 7] Ayane M., Preuss U., Koehler G., Nielsen P.J.
- Nucleic Acids Res. 19:1273-1278(1991).
- [ 8] Kawakami A., Tian Q., Duan X., Streuli M., Schlossman S.F., Anderson P.
- Proc. Natl. Acad. Sci. U.S.A. 89:8681-8685(1992).
- [ 9] Minvielle-Sebastia L., Winsor B., Bonneaud N., Lacroute F.
- Mol. Cell. Biol. 11:3075-3087(1991).
-